Using Prior Information from the Medical Literature in GWAS of Oral Cancer Identifies Novel Susceptibility Variant on Chromosome 4 - the AdAPT Method

نویسندگان

  • Mattias Johansson
  • Angus Roberts
  • Dan Chen
  • Yaoyong Li
  • Manon Delahaye-Sourdeix
  • Niraj Aswani
  • Mark A. Greenwood
  • Simone Benhamou
  • Pagona Lagiou
  • Ivana Holcátová
  • Lorenzo Richiardi
  • Kristina Kjaerheim
  • Antonio Agudo
  • Xavier Castellsagué
  • Tatiana V. Macfarlane
  • Luigi Barzan
  • Cristina Canova
  • Nalin S. Thakker
  • David I. Conway
  • Ariana Znaor
  • Claire M. Healy
  • Wolfgang Ahrens
  • David Zaridze
  • Neonilia Szeszenia-Dabrowska
  • Jolanta Lissowska
  • Eleonóra Fabiánová
  • Ioan Nicolae Mates
  • Vladimir Bencko
  • Lenka Foretova
  • Vladimir Janout
  • Maria Paula Curado
  • Sergio Koifman
  • Ana Menezes
  • Victor Wünsch-Filho
  • Jose Eluf-Neto
  • Paolo Boffetta
  • Silvia Franceschi
  • Rolando Herrero
  • Leticia Fernandez Garrote
  • Renato Talamini
  • Stefania Boccia
  • Pilar Galan
  • Lars Vatten
  • Peter Thomson
  • Diana Zelenika
  • Mark Lathrop
  • Graham Byrnes
  • Hamish Cunningham
  • Paul Brennan
  • Jon Wakefield
  • James D. Mckay
چکیده

BACKGROUND Genome-wide association studies (GWAS) require large sample sizes to obtain adequate statistical power, but it may be possible to increase the power by incorporating complementary data. In this study we investigated the feasibility of automatically retrieving information from the medical literature and leveraging this information in GWAS. METHODS We developed a method that searches through PubMed abstracts for pre-assigned keywords and key concepts, and uses this information to assign prior probabilities of association for each single nucleotide polymorphism (SNP) with the phenotype of interest--the Adjusting Association Priors with Text (AdAPT) method. Association results from a GWAS can subsequently be ranked in the context of these priors using the Bayes False Discovery Probability (BFDP) framework. We initially tested AdAPT by comparing rankings of known susceptibility alleles in a previous lung cancer GWAS, and subsequently applied it in a two-phase GWAS of oral cancer. RESULTS Known lung cancer susceptibility SNPs were consistently ranked higher by AdAPT BFDPs than by p-values. In the oral cancer GWAS, we sought to replicate the top five SNPs as ranked by AdAPT BFDPs, of which rs991316, located in the ADH gene region of 4q23, displayed a statistically significant association with oral cancer risk in the replication phase (per-rare-allele log additive p-value [p(trend)] = 2.5×10(-3)). The combined OR for having one additional rare allele was 0.83 (95% CI: 0.76-0.90), and this association was independent of previously identified susceptibility SNPs that are associated with overall UADT cancer in this gene region. We also investigated if rs991316 was associated with other cancers of the upper aerodigestive tract (UADT), but no additional association signal was found. CONCLUSION This study highlights the potential utility of systematically incorporating prior knowledge from the medical literature in genome-wide analyses using the AdAPT methodology. AdAPT is available online (url: http://services.gate.ac.uk/lld/gwas/service/config).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Novel Bi-allelic PDE6C Variant Leads to Congenital Achromatopsia

Background: The clinical phenotyping of patients with achromatopsia harboring variants in phosphordiesterase 6C (PDE6C) has poorly been described in the literature. PDE6C encodes the catalytic subunit of the cone phosphodiesterase, which hydrolyzes the cyclic guanosine monophosphate that proceeds with the hyperpolarization of photoreceptor cell membranes, as the final step of the phototransduct...

متن کامل

Direct Bisulfite Sequencing and Methylation Specific PCR to Detect Methylation of p15INK4b and F7 genes in Coronary Artery Disease Patients

Genome-Wide Association Studies (GWAS) have identified genetic variants contributing to the risk of cardiovascular disease (CVD) at the chromosome 9p21 locus. The chromosome 9p21 is an important susceptibility locus for several multifactorial diseases like ischemic stroke, aortic aneurysm, type 2 diabetes mellitus and coronary artery disease (CAD). F7 gene because of its role in activating the ...

متن کامل

Homozygosity Mapping and Targeted Sanger Sequencing Identifies Three Novel CRB1 (Cumbs homologue 1) Mutations in Iranian Retinal Degeneration Families

Background: Inherited retinal diseases (IRDs) are a group of genetic disorders with high degrees of clinical, genetic and allelic heterogeneity. IRDs generally show progressive retinal cell death resulting in gradual vision loss. IRDs constitute a broad spectrum of disorders including retinitis pigmentosa and Leber congenital amaurosis. In this study, we performed genotyping studies to identify...

متن کامل

Genetics of Type 2 Diabetes- A Review Article

Objective: Type 2 diabetes (T2D) as a complex disease is the result of genetically heterogeneous factors and environmental issues interaction. Linkage and small-scale candidate gene studies were successful in identification of genetic susceptibilities of monogenic form of diseases. However, they were largely unsuccessful while applying to the more common forms of disease. By designing Genome Wi...

متن کامل

VSEAMS: a pipeline for variant set enrichment analysis using summary GWAS data identifies IKZF3, BATF and ESRRA as key transcription factors in type 1 diabetes

MOTIVATION Genome-wide association studies (GWAS) have identified many loci implicated in disease susceptibility. Integration of GWAS summary statistics (P-values) and functional genomic datasets should help to elucidate mechanisms. RESULTS We extended a non-parametric SNP set enrichment method to test for enrichment of GWAS signals in functionally defined loci to a situation where only GWAS ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2012